109 results found.
Speech/Written
<Not Specified>,
Language Type:
Multilingual
Languages:
Polish
Availability:
Freely Available
License:
CC-BY
Size:
300M tokens OtherProduction Status:
<Not Specified>
Use:
<Not Specified>
-
Paper title:Polish Parliamentary Corpus
-
Paper track:Short papers (up to 4 pages)
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Maciej Ogrodniczuk | Institute of Computer Science, Polish Academy of Sciences | PL |
| Main Contact | Maciej Ogrodniczuk | Institute of Computer Science, Polish Academy of Sciences | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Polish
Availability:
From Owner
License:
OpenSource
Size:
500000 tokens Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Manually Annotated Corpus of Polish Texts Published between 1830 and 1918
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Witold Kieraś | Institute of Computer Science, Polish Academy of Sciences | PL |
| Author 2 | Marcin Woliński | Institute of Computer Science, Polish Academy of Sciences | PL |
| Main Contact | Witold Kieraś | Institute of Computer Science, Polish Academy of Sciences | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
English Polish
Availability:
Freely Available
License:
Princeton WordNet Licence (open)
Size:
146606 synsets Production Status:
Newly created-in progress
Use:
Universal lexico-semantic resource, for many applications
-
Paper title:Ruled-based, Interlingual Motivated Mapping of plWordNet onto SUMO Ontology
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Paweł Kędzia | Wroclaw University of Technology | PL | ||
| Author 2 | Maciej Piasecki | Wroclaw University of Technology | PL | Wroclaw University of Science and Technology | PL |
| Main Contact | Paweł Kędzia | Wrocław University of Science and Technology | None |
Documentation:
yes, on the siteLanguage Type:
Multilingual
Languages:
Polish
Availability:
Freely Available
License:
CreativeCommons
Size:
<Not Specified> Production Status:
Existing-used
Use:
POS tagging
-
Paper title:PoliTa: A multitagger for Polish
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Łukasz Kobyliński | Institute of Computer Science Polish Academy of Sciences | PL |
| Main Contact | Łukasz Kobyliński | Institute of Computer Science Polish Academy of Sciences | None |
Documentation:
in progress
Written
Tagger/Parser,
Language Type:
Multilingual
Languages:
Polish
Availability:
Freely Available
License:
GNU GPL
Size:
<Not Specified> <Not Specified>Production Status:
Existing-updated
Use:
Syntactic parsing
-
Paper title:Towards an LFG parser for Polish: An exercise in parasitic grammar development
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Agnieszka Patejuk | Polish Academy of Sciences, Institute of Computer Science | None | ||
| Author 2 | Adam Przepiórkowski | Institute of Computer Science, Polish Academy of Sciences | None | Polish Academy of Sciences, Institute of Computer Science | None |
| Main Contact | Adam Przepiórkowski | Institute of Computer Science, Polish Academy of Sciences | PL |
Documentation:
in Polish
Written
Corpus,
Language Type:
Multilingual
Languages:
Polish
Availability:
Freely Available
License:
CC BY 3.0
Size:
10845 summaries OtherProduction Status:
Newly created-in progress
Use:
Summarisation
-
Paper title:The Polish Summaries Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Maciej Ogrodniczuk | Institute of Computer Science, Polish Academy of Sciences | PL |
| Author 2 | Mateusz Kopeć | Institute of Computer Science, Polish Academy of Sciences | PL |
| Main Contact | Maciej Ogrodniczuk | Institute of Computer Science, Polish Academy of Sciences | None |
Documentation:
At corpus webpage
Written
Corpus,
Language Type:
Multilingual
Languages:
Polish
Availability:
Freely Available
License:
LGPL
Size:
366000000 <Not Specified>Production Status:
Newly created-finished
Use:
Language Modelling
-
Paper title:Rapid creation of large-scale corpora and frequency dictionaries
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Attila Zséder | <Not Specified> | None | ||
| Author 2 | Gábor Recski | <Not Specified> | None | ||
| Author 3 | Dániel Varga | BME MOKK | None | ||
| Author 4 | András Kornai | <Not Specified> | None | Hungarian Academy of Sciences | None |
| Main Contact | Gábor Recski | MTA SZTAKI | HU |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Polish
Availability:
<Not Specified>
License:
CC
Size:
Over 15,000 synsets <Not Specified>Production Status:
Existing-updated
Use:
Natural Language Processing and Engineering, Artificial Intelligence
-
Paper title:Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Zygmunt Vetulani | Adam Mickiewicz University in Poznan, Poland | PL |
| Author 2 | Grażyna Vetulani | Adam Mickiewicz University in Poznań | PL |
| Author 3 | Bartłomiej Kochanowski | Adam Mickiewicz University | PL |
| Main Contact | Zygmunt Vetulani | Adam Mickiewicz University in Poznan, Poland | None |
Documentation:
Publications in English
Written
Lexicon,
Language Type:
Multilingual
Languages:
Polish
Availability:
Freely Available
License:
Princeton WordNet
Size:
96600 synsets Production Status:
Existing-updated
Use:
multi-purpose
-
Paper title:Tools for plWordNet Development. Presentation and Perspectives
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Bartosz Broda | Wroclaw University of Technology | None | Institute of Informatics, Wrocław University of Technology | None |
| Author 2 | Marek Maziarz | Wroclaw University of Technology | None | ||
| Author 3 | Maciej Piasecki | Wroclaw University of Technology | None | Institute of Informatics, Wrocław University of Technology | None |
| Main Contact | Maciej Piasecki | Wroclaw University of Technology | PL | Wrocław University of Technology | PL |
Documentation:
publications
Transcribed speech
Corpus,
Language Type:
Multilingual
Languages:
Polish
Availability:
Freely Available
License:
LGPL-LR
Size:
15364 sentences Production Status:
Existing-updated
Use:
Acquisition
-
Paper title:A Phonemic Corpus of Polish Child-Directed Speech
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Luc Boruta | Université Paris 7 | None |
| Author 2 | Justyna Jastrzebska | Université Paris 7 | None |
| Main Contact | Luc Boruta | Univ. Paris Diderot / INRIA | FR |
Documentation:
For the original corpus, see the CHILDES manual (http://childes.psy.cmu.edu/manuals/09slavic.doc)




